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Abstract 

The property of sparse representations concerning capability for information storage is 
discussed. It is shown that this feature can be used, for instance, for an application that 
we term Image Folding. The proposed procedure is applicable by means of any suitable 
transformation. However, it is also the aim of this paper to draw attention in regard to 
q | the gain in the sparsity of an image representation achieved by combination of Discrete 

Cosine a Dirac dictionaries. 

1 Introduction 

o 

The problem of reducing the dimensionality of a piece of data without loosing their information 
content is of paramount importance in signal processing. Well established transforms, from 
classical Fourier and Cosine Transforms to Wavelets, Wavelet Packets, and Lapped Transforms, 
just to mention the most popular ones, are usually applied for generating the transformed 
domain where the processing tasks are realized. Signals amenable to transformation into a data 
set of smaller cardinality are said to be compressible. Natural images, for instance, provide a 



5_i ■ typical example of compressible data. 



In the last fifteen years emerging techniques for signal representation are addressing the mat- 
ter by highly nonlinear methodologies which decompose the signal as a superposition of vectors, 
normally called 'atoms', selected from a large redundant set called a 'dictionary' [1]. The repre- 
sentation qualifies to be sparse if the number of atoms for a satisfactory signal approximation is 
considerably smaller than the dimension of the original data. Available methodologies for highly 
nonlinear approximations are known as Pursuit Strategies. This comprises Bases Pursuit [2] 
and Matching Pursuit like greedy algorithms, including Orthogonal Matching Pursuit (OMP) 
and variations of this method [3-6]. The other ingredient of highly non linear approximations 
is, of course, the dictionary from where atoms are selected. In this respect, Gabor dictionaries 
have been shown useful for image and video processing [7,8]. Combined dictionaries, arising by 
merging for instance orthogonal bases, have received consideration in relation to the theoretical 
analysis of Pursuit Strategies [9-13]. From a different perspective, other approaches are based 
on dictionaries learned from large data sets [14,15]. 

In this Communication we would like to discuss a property intrinsic to sparse representa- 
tions; since that type of representation entails a projection onto a subspace of lower dimen- 
sionality, it generates a null space. This feature suggests the possibility of using the created 
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space for embedding information. In particular, we discuss an application involving the null 
space yielded by the sparse representation of an image, to store part of the image itself. We 
term this application Image Folding. The proposed procedure can be carried out through any 
transformation giving rise to a sparse representation. However, here we focus on a particular 
one. We use mixed dictionaries composed of a Discrete Cosine (DC) dictionary and a discrete 
Dirac basis. Both dictionaries are considered in [2], nevertheless, to the best of our knowledge 
the significant gain achieved by a combination of these dictionaries, in regard to sparse image 
representation, has not been reported. Thus, we would also like to draw attention about the 
suitability of mixed DC and Dirac dictionaries for image representation. As far as sparsity is 
concerned, at the visual acceptable level of 40 PSNR, they may render a significant improve- 
ment of sparsity in relation to established transforms such as Discrete Cosine Transform (DCT) 
and Discrete Wavelets Transform (DWT). An additional advantage of these dictionaries is that 
the OMP methodology for selecting the atoms can be implemented at a reduced complexity 
cost by means of the DCT. 

The paper is organized as follows: In Sec. 2 we discuss the mixed DC-Dirac dictionaries 
and the implementation of the OMP method based on DCT calculations. In Sec. 3 we dis- 
cuss 'something else'. By this we mean to alert that a sparse representation can be used for 
embedding information. Based on such a possibility, a scheme for image folding and a simple 
encryption procedure fully implemented by data processing are discussed. The conclusions are 
presented in Sec. 4. 

2 OMP with mixed 2D DC-Dirac Dictionaries 

Let us consider the set D a defined as 

D a = { Vl ;v l=Pt cos( 7r(2j ~^~ 1) ), j = 1, • • • , N}f =1 , 

with pi, i = 1, . . . , M normalization factors. For M = N this set is a DC orthonormal basis for 
the Euclidean space M. N . For M = 21N, with I a positive integer, the set is a DC dictionary 
with redundancy 21. 

We further consider the set D b) which is a discrete Dirac Basis, i.e., 

Db = {ei G W N with the i-th component equal to 1 and the other components equal to 0}^ 

A redundant dictionary D for R N £g> M N is then obtained as the tensor product D = D a ® D b . 
We denote by d n G M N <g> R N , n = I,..., J, where J = 2MN + M 2 + N 2 , the elements of 
dictionary D and use them to construct the atomic decomposition of an image / G WL N ® ~R N 

aS K 

/ = I>f4, (!) 

i 

where the atoms d^, i = 1, . . . , K are chosen from the dictionary D by the OMP strategy. 

2.1 Dedicated implementation of the OMP approach 

The OMP strategy for decomposing I G M. N x M. N using a dictionary {de t }f =l evolves as follows: 
Setting R 1 = I at iteration k + 1 the OMP algorithm selects the atom, dg k+1 say, as the one 
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minimizing the absolute value of the inner products (di, R k ), i = 1, . . . , J, i.e., 



d ik = arg max | (d i: R k ) | where R k = I - c k d ei . (2) 

2 = 1,. ...J ' ' 



1 = 1 



The coefficients , z = 1, . . . , A: in (121) are such that ||-R fe || 2 is minimum, which is equivalent 
to requesting R k = Ps k I, where Ps k is the orthogonal projection operator onto the span of 
the k selected atoms i.e., Sk = span{<i^}f =1 . We base our implementation for determining the 
coefficients c k , i = 1, . . . , k on recursive biorthogonalization [5,6]. At each iteration we update 
the vectors 

w k+1 = w k - w k +{(de k+1 ,w k ), 1 = 1,...,*, (3) 

where w k +l = g fc+ i/||<?fc+i|| 2 , with q k+1 = d £k+1 -P Sk d £k+1 and q x = d h . The projector P Sk is here 
computed as Ps k = QkQt where the fc-columns of matrix Qk are the vectors qi/\\qi\\, i = 1, . . . , k 
and Ql indicates the transpose conjugate of Qk- However, to calculate the coefficients of the 
linear superposition we express the projectors as Ps k = AkB^ where the fc-columns of matrix 
Ak are the selected vectors and the /c-columns of matrix are the vectors wf, i = 1, . . . , k. 
Thus, the required coefficients arise from the inner products c k = {w k , I), i = 1, . . . , k. The 
code for this type of implementation available is at [16]. 

The complexity of the OMP approach is dominated by the calculation of the quantities 
(o?i, R k ), i — 1, . . . , J in ([2]) at each iteration step. For the present dictionaries these quantities 
can be evaluated by fast DCT. In order to discuss the matter let us re-name the dictionary 
atoms as follows 

forn = 1,...,M 2 d n -> v t <g> v j} i = 1, . . . , M, j = 1, . . . , M 

for ri = .U- • 1 \l- ■ MX d n ^ Vi® e h i = 1,...,M, j = 1,...,N 

for n = M 2 + MN + 1, . . . , M 2 + 2MN d n -> ® v h % = 1, . . . , N, j = 1, . . . , M 
for n = M 2 + 2MN, . . . , J d n ~^ e { ® ej, i = 1, . . . , N, j = 1, . . . , N. 

Hence, the inner products (di, R k ), i — 1, . . . , J are calculated as 

™m r,k, \ ,tt(2s-1)(z-1), .7r(2r-l)(j-l). 
(vi®Vj,B?) = p iPj R k (s,r) cos(^ ^)cos(^ >-) (4) 



s,r=l 
N 



( Vl ®e 3 ,R k ) = Pi J2R k ( S ,j)cos C {2s ^ l) ) (5) 



8=1 

N 



v 3 ,R k ) = Pj J2R%r)cos C {2r ^ 1} ) (6) 



r=l 



(ei^e^R") = R k (i, 3 ) (7) 

If M = iV (ED is the 2D DCT of the residual whilst © and © are the ID DCT of the 
rows and columns of R k , respectively. If M = 21 N, for some positive integer /, the calculations 
can also be carried out through fast DCT by zero padding. Then with the dictionaries we are 
considering the complexity required for evaluation of inner products in (J5J) is 0(M 2 \og 2 M) + 
0(Mlog 2 M). 

The images we have represented with the above outline procedure are the 6 test images 
of Fig. [TJ For the actual processing we divide each image into blocks of 16 <S> 16 pixels. The 
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Figure 1: The six test images from left to right, top to bottom: Boat, Bridge, Film Clip, Lena, 
Mandril, Peppers. 



Image 


DCT2 U Dirac 


DCT 


WT 


Boat 


6.03 


3.63 


3.65 


Bridge 


3.7 


2.06 


2.2 


Film 


8.06 


4.53 


4.8 


Lena 


10.06 


6.5 


6.97 


Mandril 


3.32 


1.91 


1.90 


Peppers 


7.74 


4.36 


3.39 



Table 1: Compression ratio achieved by the mixed DC-Dirac dictionary and those yielded by 
DCT and DWT 
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sparsity measure we use is the compression ratio defined as 

number or total pixels 
divided the number of total coeffients 

In all the cases the number of coefficients are determined so as to produce a PSNR of 40 dB 
by the image reconstruction and the dictionary is a mixed DC redundancy 2 and Dirac basis. 
The results are given in the first column of Table I. For comparison the second column of 
this table shows results produced by fast DCT implemented using the same blocking scheme, 
and for further comparison the results produced by the Cohen-Daubechies-Feauveau 9/7 DWT 
(applied on the whole image at once) are displayed in the last column of Table I. Notice that, 
while for the fixed PSNR of 40 dB the DCT and DWT yield comparable CR the corresponding 
CR obtained by the mixed dictionaries, for all the images, is significantly higher. 



3 Something else 

By the title of this section we try to emphasize the fact that, since a sparse representation in- 
volves a projection onto a lower dimension subspace, it also creates room for storing 'something 
else'. Certainly, if with i^-dictionary's atoms {<i^}^ 1 rendering a sparse representation of an 
image we construct matrix Ak having those atoms as columns and matrix Bk the columns 
of which are the dual atoms {wf}f =1 defined in the previous section, as already mentioned, 
the approximated image is obtained as I K = AkB* k I. Thus the coefficients in the atomic 
decomposition of I K are calculated as the inner products 

cf = «,/), i = l,-..,K. (8) 

The calculation of the duals atoms as in (j3J) guarantees that they span the same space Sk as 
the selected atoms {d^}^ [5]. The sparsity property of a representation implies that Sk is a 
subspace of the image space M N ® ~R N . Hence, by denoting as S^ the orthogonal complement 
of Sk in <8> R w we have R ® R = Sk OS)" 1 S K where ® indicates orthogonal sum. Now, 
if we take a vector F G S^ and add it to the image forming the vector G = I + F to replace / 
in (jHl), since F is orthogonal to the duals {wf.}f = n we still have 

«, G) = «, I + F) — (wl I) = cf , i = 1, . . . , K. (9) 

This suggests the possibility of using the sparse representation of an image to embed the image 
with additional information stored in the vectors F G S^. In order to do this, we apply the 
earlier proposed scheme to embed redundant representations [17], which in this case operates 
as described below. 

Embedding Scheme: Consider that I K = AkB* k I is the reconstruction of a sparse represen- 
tation of an image I. We embed L = N 2 — K numbers hi, i = 1, . . . , L into a vectors F G S%- 
as prescribed below. 

• Take an orthonormal basis Ui, i = 1, . . . , L for S K and form vector F as the linear com- 
bination 

L 

F = ^hiUi. 

i=l 

• Add F to I K to obtain G = I K + F 
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Information Retrieval: 

Given G retrieve the numbers hi, i = 1, . . . , L as follows. 

• Use G to compute the coefficients of the sparse representation of / as in ([9]). Use this 
coefficients to reconstruct the image I K = Yl!i=i c f v i 

• Obtain F from the given G and the reconstructed I K as F = G — I K . Use F and the 
orthonormal basis Ui, i = 1, . . . , L to retrieve the embedded numbers hi, i = 1, . . . , L as 

hi = (ui,F), i = l,...,L 

One can encrypt the embedding procedure simply by randomly controlling the order of the 
orthogonal basis Ui, i — 1, . . . , L or by applying some random rotation to the basis, requiting a 
key for generating it. An example is given in the next section. 

3.1 Application to image folding 

We apply now the above discussed embedding scheme to fold an image. For this we simple 
process the image by dividing it into, say q, blocks I q of N q ® N q pixels. We find the represen- 
tation of each block with the DC Dirac dictionary (in the examples presented here the blocks 
are of 16 <g> 16 and we use a DC redundancy 2 and Dirac basis dictionary). Then we embed the 
coefficients of the sparse representation of some of the blocks into vectors F q , that we add to 
the pixels of the other blocks to obtain G q = I q + F q . The number of blocks we embed depends 
of the sparsity of the representation. The higher the sparsity is the less blocks are needed to 
embed the rest of them, thereby resulting in a more effective folding. 

In this example we have built the orthogonal basis Ui, i — 1, . . . , L by randomly generating 
vectors G R ® R N , i = 1, . . . , L using a 'public' initialization for the random generator. 
We then compute vectors y~i = yi — Ps K Di that we further orthonormalize to have the vectors 
Oj, % = 1, . . . , L. Now, setting an initialization key, which remains unknown for an unauthorized 
user, we apply a random permutation llk ey on vectors o« to finally generate the orthonormal 
basis Ui = n key Oj, i = 1, . . . , L, for embedding the coefficients of the remaining blocks. We 
illustrate the results on the image of Lena, which by a DC-Dirac dictionary has a compression 
ratio of approximately 10, so that we have folded most of the image into the small piece of image 
shown at the top of Fig. [2j The middled picture is the unfolded image obtained without using 
the right security key. Finally the last picture is the unfolded image using the permutation 
matrix initialized by the correct key. In Fig. [3] we present the same process but using the key 
to generate a random rotation of the basis (instead of a permutation). 

Remark 1. Notice that within this framework in order to store the folded image we need to 
quantize the blocks of pixels G q to convert them into integer numbers, which implies some lost 
of information due to the rounding operation. However, the quantization step does not prevent 
us from recovering the coefficients corresponding to the folded pixels with enough accuracy to 
produce a good representation of those blocks of image. The PSNR of the recovered image I K 
(after folding it) is 40.56 while the PSNR of the original image I K is 41.49. 
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Figure 2: The small picture at the top is the folded Image. The middle picture is the unfolded 
image without knowledge of the private key to initialize the permutation. The bottom picture 
is the unfolded picture when the correct key is used. 
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4 Conclusions 



A bonus of sparse image representation has been discussed. Namely the capability for infor- 
mation storage. It was shown that, in particular, this feature of sparse representations can be 
used, for instance, for Image Folding. While the proposed procedure is applicable through any 
transformation, including DCT and DWT, it was illustrated, by a number of examples, that a 
combination of Dirac and DC dictionaries may produce a significant gain in the sparsity of an 
image representation. 

The performance, with regard to sparsity, of highly nonlinear techniques from redundant 
dictionaries encourages the application of image folding, and vise verse. The fact that storage 
and encryption may be implemented by simple processing steps is certainly a very interesting 
feature worth being further investigated. 
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